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Abstract 

This paper considers state estimation of linear systems using analog amplify and forwarding with multiple 
sensors, for both multiple access and orthogonal access schemes. Optimal state estimation can be achieved at the 
£SJ ' fusion center using a time varying Kalman filter. We show that in many situations, the estimation error covariance 

decays at a rate of 1 /M when the number of sensors M is large. We consider optimal allocation of transmission 
powers that 1) minimizes the sum power usage subject to an error covariance constraint and 2) minimizes the 
error covariance subject to a sum power constraint. In the case of fading channels with channel state information 
the optimization problems are solved using a greedy approach, while for fading channels without channel state 
information but with channel statistics available a sub-optimal linear estimator is derived. 
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I. Introduction 

Wireless sensor networks are collections of sensors which can communicate with each other or to a central node 
or base station through wireless links. Potential uses include environment and infrastructure monitoring, healthcare 
qq ', and military applications, to name a few. Often these sensors will have limited energy and computational ability 
CO | which imposes severe constraints on system design, and signal processing algorithms which can efficiently utilise 

■ these resources have attracted great interest. 
In recent years there has been a considerable literature on estimation and detection schemes designed specifically 

for use in wireless sensor networks. Work on detection in wireless sensor networks include [1] which studies the 
asymptotic optimality of using identical sensors in the presence of energy constraints, and [2]-[4] which derives 
fusion rules for distributed detection in the presence of fading. Parameter estimation or estimation of constant 
signals is studied in e.g. [5]-[8] where issues of quantization and optimization of power usage are addressed. Type 

■ based methods for detection and estimation of discrete sources are proposed and analyzed in [9]-[ll]. Estimation 
of fields is considered has been considered in e.g. [12]— [14]. 

A promising scheme for distributed estimation in sensor networks is analog amplify and forward [15] (in 
distributed detection analog forwarding has also been considered in e.g. [16], [17]), where measurements from 
the sensors are transmitted directly (possibly scaled) to the fusion center without any coding, which is motivated by 
optimality results on uncoded transmissions in point-to-point links [18], [19]. (Other related information theoretic 
results include [20], [21].) Analog forwarding schemes are attractive due to their simplicity as well as the possibility 
of real-time processing since there is no coding delay. In [15] the asymptotic (large number of sensors) optimality 
of analog forwarding for estimating an i.i.d. scalar Gaussian process was shown, and exact optimality was later 
proved for a "symmetric" sensor network [22]. Analog forwarding with optimal power allocation is studied in [23] 
and [24] for multi-access and orthogonal schemes respectively. Lower bounds and asymptotic optimality results for 
estimating independent vector processes, is addressed in [25]. Estimation with correlated data between sensors is 
studied in [26], [27]. Other aspects of the analog forwading technique that have been studied include the use of 
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different network topologies [28], other multiple access schemes such as slotted ALOHA [29], and consideration 
of the impact of channel estimation errors [30] on estimation performance. 

Most of the previous work on analog forwarding have dealt with estimation of processes which are either constant 
or i.i.d over time. In this paper we will address the estimation of dynamical systems using analog forwarding of 
measurements. In particular, we will consider the problem of state estimation of discrete-time linear systems using 
multiple sensors. As is well known, optimal state estimation of a linear system can be achieved using a Kalman 
filter. Other work on Kalman filtering in sensor networks include studies of optimal sensor data quantization [31], 
Kalman filtering using one bit quantized observations [32] where performance is shown to lie within a constant 
factor of the standard Kalman filter, and estimation of random fields with reduced order Kalman filters [14]. Another 
related area with a rich history is that of distributed Kalman filtering, where the main objectives include doing 
local processing at the individual sensor level to reduce the computations required at the fusion center [33], [34], 
or to form estimates at each of the individual sensors in a completely decentralized fashion without any fusion 
center [35]. However in our work we assume that computational resources available at the sensors are limited so 
that they will only take measurements and then transmit them to the fusion center for further processing, using 
uncoded analog forwarding. 

Summary of Contributions: 

In this paper we will mainly focus on estimation of scalaiQ linear dynamical systems using multiple sensors, as the 
vector case introduces additional difficulties such that only partial results can be obtained. We will be interested in 
deriving the asymptotic behaviour of the error covariance with respect to the number of sensors for these schemes, 
as well as optimal transmission power allocation to the sensors under a constraint on the error covariance at the 
fusion center, or a sum power constraint at the sensor transmitters. We consider both static and fading channels, 
and in the context of fading channels, we consider various levels of availability of channel state information (CSI) 
at the transmitters and the fusion center. More specifically, we make the following key contributions: 

• We show that (for static channels with full CSI) for the multi-access scheme, the asymptotic estimation error 
covariance can be driven to the process noise covariance (which is the minimum attainable error) as the number 
of sensors M goes to infinity, even when the transmitted signals from each sensor is scaled by ^= (which 
implies that total transmission power across all sensors remains bounded while each sensor's transmission 
power goes to zero). This is a particularly attractive result since sensor networks operate in a energy limited 
environment. For the orthogonal access scheme, this result holds when the transmitted signals are unsealed, 
but does not hold when the transmitted signals are scaled by -h=. 

• The convergence rate of these asymptotic results (when they hold) is shown to be although it is seen via 
simulation results that the asymptotic approximations are quite accurate even for M = 20 to 30 sensors. 

• In the case of a small to moderate number of sensors, we derive a comprehensive set of optimal sensor 
transmit power allocation schemes for multi-access and orthogonal medium access schemes over both static 
and fading channels. For static channels, we minimize total transmission power at the sensors subject to a 
constraint on the steady state Kalman estimation error covariance, and also solve a corresponding converse 
problem: minimizing steady state error covariance subject to a sum power constraint at the sensor transmitters. 
For fading channels (with full CSI), we solve similar optimization problems, except that the error covariance 
(either in the objective function or the constraint) is considered at a per time instant basis, since there is no well 
defined steady state error covariance in this case. For the fading channel case with no CSI (either amplitude or 
phase), the results are derived for the best linear estimator which relies on channel statistics information and 
can be applied to non-zero mean fading channels. It is shown that these optimization problems can be posed 
as convex optimization problems. Moreover, the optimization problems will turn out to be very similar to 
problems previously studied in the literature (albeit in the context of distributed estimation of a static random 
source), namely [23], [24], and can actually be solved in closed form. 

• Numerical results demonstrate that for static channels, optimal power allocation results in more benefit for the 
orthogonal medium access scheme compared to the multi-access scheme, whereas for fading channels, it is 
seen that having full CSI is clearly beneficial for both schemes, although the performance improvement via the 
optimal power allocation scheme is more substantial for the orthogonal scheme than the multi-access scheme. 

The rest of the paper is organized as follows. Section JI] specifies our scalar models and preliminaries, and 

'By scalar linear system we mean that both the states and individual sensor measurements are scalar. 
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gives a number of examples between multi-access and orthogonal access schemes, which show that in general 
one scheme does not always perform better than the other. We investigate the asymptotic behaviour for a large 
number of sensors M in Section |III] Power allocation is considered in Section |IVJ where we formulate and solve 
optimization problems for 1) an error covariance constraint and 2) a sum power constraint. We first do this for 
static channels, before focusing on fading channels. In the case where we have channel state information (CSI) we 
use a greedy approach by performing the optimization at each time step. When we don't have CSI, we will derive 
a sub-optimal linear estimator similar to [36]-[38], which can be used for non-zero mean fading. Numerical studies 
are presented in Section [V] Extensions of our model to vector and MIMO systems is considered in Section |VlJ 
where we formulate the models and optimization problems, and outline some of the difficulties involved. 

II. Models and preliminaries 
Throughout this paper, i represents the sensor index and k represents the time index. Let the scalar linear system 

be 

x k+ i = ax k + w k 

with the M sensors each observing 

Vi,k = Cix k + v i>k , i = l,...,M 

with w k and Vi k being zero-mean Gaussians having variances and of respectively, with the wi fe's being 
independent between sensors. Note that the sensors can have different observation matrices a and measurement 
noise variances of, and we allow a and Cj to take on both positive and negative values. It is assumed that the 
parameters a, Cj,o^ and of are knownH Furthermore, we assume that the system is stable, i.e. \a\ < 1. 



A. Multi-access scheme 

In the (non-orthogonal) multi-access scheme the fusion center receives the sum 

M 

h = '^ J Ui,khi,kVi,k + n k (1) 
i=l 

where n k is zero-mean complex Gaussian with variance 2o^, h i:k are the complex-valued channel gains, and «j k 
are the complex-valued multiplicative amplification factors in an amplify and forward scheme. We assume that all 
transmitters have access to their complex channel state information (CSI)0 and the amplification factors have the 
form 



OLi,k = Oii >k -~- 

\h. 



h* 



.k 



where a>i jk is real-valued, i.e. we assume distributed transmitter beamforming. Defining hi )k = \hi jk \, z k = 
n k = R[n k ], we then have 

M 

z k = J ^2 a i,khi,kUi,k + n k (2) 
i=i 

Note that the assumption of CSI at the transmitters is important in order for the signals to add up coherently in 
In principle, it can be achieved by the distributed synchronization schemes described in e.g. [39], [40], but may not 
be feasible for large sensor networks. However, in studies such as [16], [39] it has been shown in slightly different 
contexts that for moderate amounts of phase error much of the potential performance gains can still be achieved. 
Continuing further, we may write 

M M 

z k = J ^2 a i,khi,kCiX k + ai,khi, k v ijk + n k = c k x k + v k 
i=i i=i 

2 We assume that these parameters are static or very slowly time-varying, and hence can be accurately determined beforehand using 
appropriate parameter estimation/system identification algorithms. 

3 The case where the channel gains are unknown but channel statistics are available is addressed in Section |IV-E| This can also be used 
to model the situation where perfect phase synchronization cannot be achieved [25]. 
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where c k = YliLi a i,kK,kCi an d v k = YliLi a i,kK,kVi tk + n k- Hence, we have the following linear system 

x k+ i = ax k + w k , z k = c k x k + v k (3) 

with v k having variance f k = YliLi a lk^ik a i + °n- Define the state estimate and error covariance as 

Xk+i\k = E{x k+ i\{z ,...,z k }} 

Pk+i\k = IE [(x fc+ i - x k+1 \ k ) 2 \{z , . . . ,z k }] 

where again P k+ i\ k is scalar. Then it is well known that optimal estimation of the state x k in the minimum mean 
squared error (MMSE) sense can be achieved using a (in general time-varying) Kalman filter [41]. Using the 
shorthand notation P k+ \ = P k +±\ k , the error covariance satisfies the recursion: 



We also remark that even if the noises are non-Gaussian, the Kalman filter is still the best linear estimator. 



B. Orthogonal access scheme 

In the orthogonal access scheme each sensor transmits its measurement to the fusion center via orthogonal 
channels (e.g. using FDMA or CDMA), so that the fusion center receives 

Zi,k = ai,khi,kVi,k + ni,k, i = l,...,M 
with the nj fe's being independent, zero mean complex Gaussian with variance 2a 2 L ,\/i. We will again assume CSI 

at the transmitters and use a i>k = a i>k -^-, with a i>k € R. Let h ijk = \h i>k \, z ijk = 3t[z itk ], n ijk = §ft[n ijfe ]. The 
situation is then equivalent to the linear system (using the superscript "o" to distinguish some quantities in the 
orthogonal scheme from the multi-access scheme): 

x k+ i = ax k + w k , z° k = C° k x k + \° k 



where 





Zl,k 




ai,khi,kCi 




oti,khi,kVi,k + ni,k 








> ^k = 




,n = 






_ ZM,k _ 




OZM,khM,kCM 




&M,khM,kVM,k + nu,k 


with the covariance of v° k being 
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The state estimate and error covariance are now defined as 

x° k+ i\k = ®[x k+1 \{z° ,...,z° k }] 
= E 



^k+i\k 



(%k+i - £fc+l|fc) 2 |{ z 0i • • • ) z fc} 



Optimal estimation of x k in the orthogonal access scheme can also be achieved using a Kalman filter, with the 
error covariance now satisfying the recursion: 



P° 

r k+l 



■ 2 P k - a 2 (P k ) 2 C° k (C° k P k C° k + R° k ) 1 C° k + a: 



where C° k and as defined above are respectively a vector and a matrix. To simplify the expressions, note that 



n° T (n r>or>o T n°\-ir i0 — k k *"k 
^k V^k^k^k + K fcJ *-fc 



l + P^Cl R u k CI 
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which can be shown using the matrix inversion lemma. Hence 



a 2 PZ 
1 + P k C k R k C k 



Pk+i — _ T fc _„-i — + a w (5) 



where one can also easily compute C° k C° k = Y^iLi a ? k^i k c i I \ a l k^l k°1 + °n)- The advantage of the 
orthogonal scheme is that we do not need carrier-level synchronization among all sensors, but only require 
synchronization between each individual sensor and the fusion center [24]. 



C. Transmit powers 

The power 7^ used at time k by the ith sensor in transmitting its measurement to the fusion center is defined as 
H,k = of k^iVikl- F° r stable scalar systems, it is well known that if {x k } is stationary we have E[x|] = 3^2, 
In both the multi-access and orthogonal schemes, the transmit powers are then: 



2 

2 I „2 u w I 2 



D. Steady state error covariance 

In this and the next few sections we will let h itk = hi (and hence hi :k = hi) , V/c be time-invariant, deferring the 
discussion of time-varying channels until Section |IV-D| We will also assume in this case that a^ k = ai,\/k, i.e. 
the amplification factors don't vary with time, and we will drop the subscript k from quantities such as c k and f k . 

From Kalman filtering theory, we know that the steady state (as k — > 00) error covariance P^ (provided it exists) 
in the multi-access scheme satisfies (c.f.©) 

a 2 Poof 2 //-N 
P « = *j£+^ + *« (6) 

where f and c are the time-invariant versions of f k and Cfe@ For stable systems, it is known that the steady state 
error covariance always exists [41, p. 77]. For c 7^ 0, the solution to this can be easily shown to be 

p _ (a 2 - l)f + c 2 a 2 w + ygg - l)f + ggg + gjjp 

^00 - 2 _ 2 (/) 

In the "degenerate" case where c = 0, we have P^ = cr^/(l — a 2 ). It will also be usful to write (O as 



with 5 = c 2 /f regarded as a signal-to-noise ratio (SNR). We have the following property. 
Lemma 1: P^ as defined by ([8]) is a decreasing function of S. 

Proof: See the Appendix. ■ 
Similarly, in the orthogonal access scheme, the steady state error covariance P^ satisfies (c.f.©) 

n 2 P° 

P °oo = -J?- -^ +°l (9) 

1 + P£C R° C° 

where R° and C° are the time-invariant versions of R^ and C° k . We can easily compute C° R° C° = Y^aLx a lhl c i I ( a l^l a i^~ 

— T _ — 1 _ 

<7 2 ) with S° = C° R° C° regarded as a signal-to-noise ratio. The solution to Q can then be found as 



p o = a 2 - 1 + o 2 w S° + yV - 1 + aj Soy + 4aj g 

25° ' 



Lemma 2: P^ as defined by (fTOb is a decreasing function of S" 3 
The proof is the same as that of Lemma Q] in the Appendix. 



4 The assumption of time-invariance is important. For time- varying rt and Ck, the error covariance usually will not converge to a steady 
state value. 
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Comparing (O and ( fTOb we see that the functions for P^ and P^ are of the same form, except that in the 
multi-access scheme we have 



and in the orthogonal scheme we have 



qo — r< oT D 1 r<° _ \^ 

E. Some examples of multi-access vs orthogonal access 

A natural question to ask is whether one scheme always performs better than the other, e.g. whether S > S° 
given the same values for aj, /ij, q, cr 2 , a 2 are used in both expressions. We present below a number of examples 
to illustrate that in general this is not true. Assume for simplicity that the «j's are chosen such ajCj are positive 
for all i = 1,...,M. 

1) Consider first the case when a 2 = 0. Then we have the inequality 

1 a 2 h 2 c 2 (E£i«iW 



which can be shown by applying Theorem 65 of [42]. So when <r 2 = 0, S° > S and consequently P^ will be 
smaller than P^. The intuitive explanation for this is that if there is no noise introduced at the fusion center, 
then receiving the individual measurements from the sensors is better than receiving a linear combination of the 
measurements, see also [43]. 

2) Next we consider the case when the noise variance a 2 is large. We can express S — S° as 

. At At 



—, n i . / ajhjCj) 2 TT (afhfaf + a 2 ) 

*M „2i,2_2 , _2\TTAf /„ 2i,2_2 , _2\\ v /— ' ! 1 1; ll l 1 ! 1 nl 



M At 
2t2 2/V^ 2t2 2 i 2\ TT / 2>2 2 , 2\ 2 r2 2 /V^ 2j2 2 , 2n TT /" 2r2 2 , 2 

- "l^lCll^^^^ + °n) 11 KMi + °"J ^Mh M c M{Z^ a i h i a i + a n) H K^O"* +^ 

i=l i:i7^1 i=l iii^M 

The coefficient of the (cr 2 ) M term in the numerator is (^YlfLi otihiC^j — a 2 h 2 c 2 — ■ ■ ■ — o? M h? M <? M > 0. For a 2 
sufficiently large, this term will dominate, hence S > S° and the multi-access scheme will now have smaller error 
covariance than the orthogonal scheme. 

3) Now we consider the "symmetric" situation where on = a,Ci = c, a 2 = a 2 , hi = h,\/i. Then we have 

M 2 a 2 h 2 c 2 Ma 2 h 2 c 2 „ n Ma 2 h 2 c 2 

and S 



Ma 2 h 2 a 2 + a 2 a 2 h 2 a 2 + al/M a 2 h 2 a 2 + o 2 n 

Hence S > S°, with equality only when a 2 = (or M = 1). Thus, in the symmetric case, the multi-access scheme 
outperforms the orthogonal access scheme. 

4) Suppose a 2 / 0. We wish to know whether it is always the case that S > S° for M sufficiently large. The 
following counterexample shows that in general this assertion is false. Let on = 1, hi = 1, a 2 = 1, Vi Let M/2 of 
the sensors have a = 1, and the other M/2 sensors have Cj = 2. We find that 

(M/2 + M) 2 _9 M !C o_ M1 + 4 _ 5 M 

o — — ~ o — — t o — 7~~~ and O 



M + cj 2 41 + a 2 /M 2 I + a 2 21 + a 2 

If e.g. cr 2 = 1/8, then it may be verified that S° > S for M < 10, S° = S for M = 10, and 5 1 > S° for M > 10, 
so eventually the multi-access scheme outperforms the orthogonal scheme. On the other hand, if 2 (i+ct 2 ) > f or 
<7 2 < 1/9, we will have S° > S no matter how large M is. 
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III. Asymptotic behaviour 

Since P^ is a decreasing function of S (similar comments apply for the orthogonal scheme), increasing S will 
provide an improvement in performance. As S — > oo, we can see from © that P^ — ► of,, the process noise 
variance. Note that unlike e.g. [15], [24] where the mean squared error (MSE) can be driven to zero in situations 
such as when there is a large number of sensors, here the lower bound of, on performance is always strictly greater 
than zero. When the number of sensors is fixed, then it is not too difficult to show that S will be bounded no matter 
how large (or small) one makes the aij's, so getting arbitrarily close to ct 2 is not possible. On the other hand, if 
instead the number of sensors M is allowed to increase, then — > <r^ as M — > oo can be achieved in many 
situations, as will be shown in the following. Moreover we will be interested in the rate at which this convergence 
occurs. 

In this section we will first investigate two simple strategies, 1) cm = l,Vi, and 2) Qj = 1/y/M, Vio For the 
"symmetric" case (i.e. the parameters are the same for each sensor) we will obtain explicit asymptotic expressions. 
We then use these results to bound the performance in the general asymmetric case in Section IIII-CI Finally, we 
will also investigate the asymptotic performance of a simple equal power allocation scheme in Section IIII-Di We 
note that the results in this section assume that large M is possible, e.g. ability to synchronize a large number of 
sensors in the multi-access scheme, or the availability of a large number of orthogonal channels in the orthogonal 
scheme, which may not always be the case in practice. On the other hand, in numerical investigations we have 
found that the results derived in this section are quite accurate even for 20 — 30 sensors, see Figs. Q] and [2] in Section 

El 



A. No scaling: a.{ = l,Vi 

Let ai = l,Vi, so measurements are forwarded to the fusion center without any scaling. Assume for simplicity 
the symmetric case, where Cj = c, of = a 2 , hi = h, Vi. 

In the multi-access scheme, c = Mhc, and has variance f = Mh 2 a 2 + of, so that S = ]Jh 2 a 2 +a 2 ■ Since 
S — > oo as M — > oo, we have by the previous discussion that P^ — ► of,. The rate of convergence is given by the 
following: 

Lemma 3: In the symmetric multi-access scheme with = l,Vi, 



c 2 M \ M 2 



<4 + ^T7 + 0(^] (ID 



as M — > oo. 

Proof: See the Appendix. ■ 
Thus the steady state error covariance for the multi-access scheme converges to the process noise variance of,, 
at a rate of 1/M. This result matches the rate of 1/M achieved for estimation of i.i.d. processes using multi-access 
schemes, e.g. [15], [44]. 

In the orthogonal scheme we have S° = ^2^2 , so S° — > 00 as M — > 00 also. By similar calculations to the 
proof of Lemma [3] we find that as M — ► 00 

p o 2 , a 2 (h 2 al + al) 1 / 1 \ 2 a 2 (a 2 + a 2 Jh 2 ) 1 M\ 

Therefore, the steady state error covariance again converges to a 2 ^ at a rate of 1/M, but the constant a ^"^"^ ^ 
in front is larger. This agrees with example 3) of Section III-EI that in the symmetric situation the multi-access 
scheme will perform better than the orthogonal scheme. 



B. Scaling on = 1/yM, Vi 

In the previous case with a, = l,Vi, the power received at the fusion center will grow unbounded as M — » 00. 
Suppose instead we let = l/y/M, Vi, which will keep the power received at the fusion center bounded (and 

5 These strategies are similar to the case of "equal power constraint" and "total power constraint" in [44] (also [16]), and various versions 
have also been considered in the work of [15], [23]-[25], in the context of estimation of i.i.d. processes. 
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is constant in the symmetric case), while the transmit power used by each sensor will tend to zero as M — > oo. 
Again assume for simplicity that q = c, of = of, hi = h,\/i. 

In the multi-access scheme we now have S = Jf\ ,° 2 . so that as M — > oo, 

, c?(ol + <,l/h?) i n 



Thus we again have the steady state error covariance converging to the process noise variance of, at a rate of 
1/M. In fact, we see that this is the same expression as (fT2l) in the orthogonal scheme, but where we were using 
cti = l,Vi The difference here is that this performance can be achieved even when the transmit power used by 
each individual sensor will decrease to zero as the number of sensors increases, which could be quite desirable in 
power constrained environments such as wireless sensor networks. For i.i.d. processes, this somewhat surprising 
behaviour when the total received power is bounded has also been observed [25], [44]. 

In the orthogonal scheme we have S° = h2(7 t/^j +(T 2 > and we note that now 5° is bounded even as M — > oo, so 
P^ cannot converge to a^, as M -> oo. For a more precise expression, we can show by similar computations to 
the proof of Lemma [3] that for large M, 



pQ _ (a 2 - l)of + hW w + V(« 2 - 1)M + 2(a 2 + l)o 2 h 2 c 2 o 2 + Wot 



2h 2 c 2 

+ 



a 2 - l)a 2 ^ (a 2 + l)h A a 2 v c 2 a 2 w + (a 2 - l) 2 ^ 2 ^ 2 



2c 2 2/i 2 cV(a 2 - 1)M + 2(a 2 + l)a 2 h 2 c 2 a 2 + h^ai 



m + °(m 2 



(14) 



Noting that — — ^ ^E^? — ^ <T u>' steady state error covariance will con- 

verge as M — > oo to a value strictly greater than a 2 ^, though the convergence is still at a rate 1/M. Analogously, 
for i.i.d. processes it has been shown that in the orthogonal scheme the MSE does not go to zero as M — > oo when 
the total power used is bounded [24]. 



C. General parameters 



The behaviour shown in the two previous cases can still hold under more general conditions on Cj, of and hi. 
Suppose for instance that they can be bounded from both above and below, i.e. < c m j„ < |cj| < c max < oo, 
< a min - a i - a max < oo, < h min < hi < h max < oo,Vi. We have the following: 

Lemma 4: In the general multi-access scheme, as M — > oo, using either no scaling of measurements, or scaling 
of measurements by l/y/M, results in 



i^o =- rri + () [ _L 



-r { / I 

In the general orthogonal scheme, using no scaling of measurements results in 

P° = a 2 + o(— 

00 w \ M 

as M — > oo, but does not converge to a limit (in general) as M —>■ oo when measurements are scaled by 
1/y/M. 

Proof: See the Appendix. ■ 

D. Asymptotic behaviour under equal power allocation 

When the parameters are asymmetric, the above rules will in general allocate different powers to the individual 
sensors. Another simple alternative is to use equal power allocation. Recall that the transmit power used by each 
sensor is % = a 2 (c 2 ^^- + erf). If we allocate power 7 to each sensor, i.e. 7, = 7,Vi, then 



a, = V«+.?(i-« 2 ) <15) 
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If instead the total power "/total is to be shared equally amongst sensors, then ji = "/total and 

/ ItotalQ- ~ a 2 ) n/ . s 
1 - V M(cM + o?(l-a')) (16) 

Asymptotic results under equal power allocation are quite similar to Section UlI-CI namely: 

Lemma 5: In the general multi-access scheme, as M — ► oo, using the equal power allocation (fT3T ) or (fT6l ) results 

in 

In the general orthogonal scheme, using the equal power allocation (fT31) results in 

as M — > oo, but does not converge to a limit as M —* oo when using the power allocation (fT6l ). 

Proof: See the Appendix ■ 



£. Remarks 

1) Most of the previous policies in this section give a convergence rate of 1/M. We might wonder whether one 
can achieve an even better rate (e.g. 1/M 2 ) using other choices for oti, though the answer turns out to be no. To 
see this, following [15], consider the "ideal" case where sensor measurements are received perfectly at the fusion 
center, and which mathematically corresponds to the orthogonal scheme with a 2 = 0, on = = l,Vi. This 
idealized situation provides a lower bound on the achievable error covariance. We will have S° = E^cf/af, 
which can then be used to show that P£, converges to a 2 , at the rate 1/M. Hence 1/M is the best rate that can be 
achieved with any coded/uncoded scheme. 

2) In the previous derivations we have not actually used the assumption that \a\ < 1, so the results in Sections 
IIII-AI - ITH-Cl will hold even when the system is unstable (assuming C ^ 0). However for unstable systems, E[ac?] 
becomes unbounded as k — > oo, so if the cafe's are time invariant, then more and more power is used by the sensors 
as time passes. If the application is a wireless sensor network where power is limited, then the question is whether 
one can choose these cafe's such that both the power used by the sensors and the error covariances will be bounded 
for all times. Now if there is no noise at the fusion center, i.e. = 0, then a simple scaling of the measurements at 
the individual sensors will work. But when ^ 0, as will usually be the case in analog forwarding, we have not 
been able to find a scheme which can achieve this. Note however that for unstable systems, asymptotic results are 
of mathematical interest only. In practice, in most cases, we will be interested in finite horizon results for unstable 
systems where the system states and measurements can take on large values but are still bounded. In such finite 
horizon situations, one can perform optimum power allocation at each time step similar to Section IIV-DI but for 
a finite number of time steps, or use a finite horizon dynamic programming approach similar to Section IIV-D.4I 
However these problems will not be addressed in the current paper. 



IV. Optimal power allocation 

When there are a large number of sensors, one can use simple strategies such as on = 1/ vM, Vi, or the equal 
power allocation (fT6l ). which will both give a convergence of the steady state error covariance to a 2 , at a rate of 
1/M in the multi-access scheme, while bounding the total power used by all the sensors. However when the number 
of sensors is small, one may perhaps do better with different choices of the c^'s. In this section we will study some 
relevant power allocation problems. These are considered first for static channels in the multi-access and orthogonal 
schemes, in Sections IIV-AI and IIV-BI respectively. Some features of the solutions to these optimization problems 
are discussed in Section IIV-CI These results are then extended to fading channels with channel state information 
(CSI) and fading channels without CSI in Sections IIV-DI and IIV-EI respectively. 
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A. Optimization problems for multi-access scheme 

1) Minimizing sum power: One possible formulation is to minimize the sum of transmit powers used by the 
sensors subject to a bound D on the steady state error covariance. More formally, the problem is 

M M f j,„2 

nun 

i=i i=i 
subject to Pqo < D 

with Pqo given by $7}. Some straightforward manipulations show that the constraint can be simplified to 



M M / 2 2 \ 



f (a 2 D + a 2 w -D)+ c 2 D(a 2 w - D) < (17) 



i.e. 

/ M 



$k 2 o\ + a 2 n (a 2 D + a 2 w -D) + [J2 D(a 2 w - D) < 




\i=l 

Now define s = h\C\cti + • • • + Kmcmolm- Then the optimization problem becomes 

A / C 2 G 2 

„2 L i°w ,2 



mm > a,- | - + a, 

a lt ...,a M ,s 

i=l 

/ M \ M 



(18) 



,CV; 



subject to a i h i a i + a n ( o2 -° + <?l ~ D ) < s 2 D(D - a^) and s = ^ TijCj 

Vi=l / i=l 

Before continuing further, let us first determine some upper and lower bounds on D. From Section [Till a lower 
bound is D > a^, the process noise variance. For an upper bound, suppose c = so we don't have any information 
about Xfc. Since we are assuming the system is stable, one can still achieve an error covariance of (just let 

&k = 0, V&), so D < j^j- Hence in problem ([TBI both D — a 2 ^ and a 2 D + a 2 ^ — D axe, positive quantities. 
To reduce the amount of repetition in later sections, consider the slightly more general problem 



M 



mm y^cqKi 
oct,...,a M ,s 

1=1 (19) 

/ M \ M v ' 

subject to afn + cr 2 x < s 2 y and s = ctipi 

\i=l / i=l 

where x > 0, y > 0, Ki > 0, pj G R, Tj > 0, i = 1, . . . , M are constants. In the context of (TT8l . x = a 2 £> + — D, 
y = D(D - al), pi = hid, n = h 2 a 2 and m = (J&jt + afj for i = 1, . . . , M. 

The objective function of problem (fT9l is clearly convex. Noting that ri,c7 2 ,x and y are all positive, the set of 
points satisfying f X)i=i T i a i + a n) x = V s2 * s tnen a <3 ua dric surface that consists of two pieces, corresponding 

to s > and s < 0@ Furthermore, the set of points satisfying f J^i=i r « a ? + a n) x — V 8 " 2 an ^ s > 0, and the set 

of points satisfying (^2i=i T i a i + °"n) x ^ 2/ s2 an ^ s < 0, are both known to be convex sets, see e.g. Prop. 15.4.7 
of [45]. Hence the parts of the feasible region corresponding to s > and s < are both convex, and the global 
solution can be efficiently obtained numerically. Furthermore, following similar steps to [23], a solution in (mostly) 
closed form can actually be obtained. We omit the derivations but shall summarise what is required. 
One first solves numerically for A the equation 

M \ 2 1 

y v 2 i 

f-f Ki + Xnx y 

6 In three dimensions this surface corresponds to a "hyperboloid of two sheets". 
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Since the left hand side is increasing with A solutions to this equation will be unique provided it exists. Taking 
limits as A — » oo, we see that a solution exists if and only if 

M 2 

£->- (20) 

Equation d20l) thus provides a feasibility check for the optimization problem (fT9l ). In the context of (fT8l ). one can 
easily derive that (1201) implies 2~2iLi %k > ^gi^y > which indicates that the sum of the sensor signal to noise 
ratios must be greater than a threshold (dependent on the error covariance threshold D) for the optimization problem 
dT8b to be feasible. 

Next, we compute n from 

/ M 2 \ _1 

,2 _ „1 ,. / ^ _ Pi '''< 1 



^4A(k; + At;x) 2 ^ 
Finally we obtain the optimal a^'s (denoted by a*) 



with the resulting powers 



ol* = . W \ -,i = l,...,M. (21) 

2{Ki + ATiX) 

*2 *2 ( 2 2\ • -i ji.r 

7i = «j Ki = Oi I C- 1 _ q2 + CTj 1 , 1 = 1, . . . , M 



Note that depending on whether we choose n to be positive or negative, two different sets of a*'s will be obtained, 
one of which is the negative of the other, though the 7j's and hence the optimal value of the objective function 
remains the same. 

Another interesting relation that can be shown (see [23]) is that the optimal sum power satisfies 

M 

Itotal = £ °^ Ki = X(J n X ( 22 ) 
i=l 

This relation will be useful in obtaining an analytic solution to problem (l23l next. 

2) Minimizing error covariance: A related problem is to minimize the steady state error covariance subject to 
a sum power constraint itotal- Formally, this is 

min Poo 

M 

9 / C? ™ 9 \ 

Itotal 



m / c 2 a 2 \ 

subject to £ q? I ji-*L + af J < 

i=i ^ ' 



with Pqo again given by $7}. For this problem, the feasible region is clearly convex, but the objective function 
is complicated. To simplify the objective, recall from Lemma Q] that Pqo is a decreasing function of S = c 2 /f. 
Thus maximizing c 2 /f (or minimizing f/c 2 ) is equivalent to minimizing P^, which has the interpretation that 
maximizing the SNR minimizes P^. Hence the problem is equivalent to 



mm , 



M / 2 2 \ M 

subject to ^2 Q i ( 7 % _ W 2 + a i) ^ "Itotal and s = £ /w , 

i=l ^ ' i=l 



We again introduce a more general problem 

EM 2 i 2 
i=l QfTi + < 



mm ., 

Hi Om|S 

M M 

subject to £ a^Ki < j tota i and s = £ 
i=l »=i 



(23) 
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with x > 0,y > 0, Ki > 0, pi EM,Ti > 0,i = 1, . . . ,M being constants. The objective function is still non-convex, 
however by making use of the properties of the analytical solution to problem ( fT9l ), such as the relation (l22l) . an 
analytical solution to problem d23l ) can also be obtained. The optimal aj's can be shown to satisfy: 

/ m p2 y 1 2 

af = Jtotal V 7 ; rp^Kj — ^ — ^ Kj (24) 

\ ^ ( K j + ItotalirY J («i + Ttoia*^-) 2 

The details on obtaining this solution are similar to [23] and omitted. 



B. Optimization problems for orthogonal access scheme 

1 ) Minimizing sum power: The corresponding problem of minimizing the sum power in the orthogonal scheme 

is 

M M / 2 2 



mm 



/ r 2 rr 2 \ 



subject to < D 

with P^q now given by (ITOb - By a rearrangement of the constraint, this can be shown to be equivalent to 



M / 2 2 \ 

min Wf-^ + a?) 



(25) 



^a\h\o1 + ol- D(D — a. 



Note that in contrast to the multi-access scheme, we now write the minimization over af rather than c^. Since each 
of the functions 

-afhfc 2 _ -cf crlcf/a 2 
<4h 2 a 2 + a 2 a 2 a 2 h 2 a 2 + a 2 

is convex in a 2 , the problem will be a convex optimization problem in (a 2 , . . . ,a 2 M ). Note that without further 
restrictions on aj we will get 2 M solutions with the same values of the objective function, corresponding to the 
different choices of positive and negative signs on the c^'s. This is in contrast to the multi-access scheme where 
there were two sets of solutions. For simplicity we can take the solution corresponding to all «j 

An analytical solution can also be obtained. To reduce repetition in later sections, consider the more general 
problem 



M 

mm > a 2 Kj 



E' 



M 99 

a tPt ^ x 

subject to > 1 1 > - 

i af n + ai y 



(26) 



i- 



where x > 0, y > 0, Ki > 0,pi G R, T{ > 0,i = 1,...,M are constants and have similar interpretations as in 
Section IIV-A. 1 1 Since the derivation of the analytical solution is similar to that found in [24] (though what they 
regard as is af here), it will be omitted and we will only present the solution. 
Firstly, the problem will be feasible if and only if 

M 9 

y^p[ > E 

T~i y 

i=l 1 y 

7 In general this is not possible in the multi-access scheme. For instance, if we have two sensors with c\ being positive and c-z negative, 
the optimal solution will involve ai being positive and a2 negative, or vice versa. Restricting both Qi's to be positive in the multi-access 
scheme will result in a sub-optimal solution. 
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Interestingly, this is the same as the feasibility condition d20b for problem ( fT9l ) in the multi-access scheme, indicating 
that the total SNR for the sensor measurements must be greater than a certain threshold (dependent on D). The 
optimal aj's satisfy 

where is the function that is equal to x when x is positive, and zero otherwise. To determine A, now assume 
that the sensors are ordered such that 

d > ... > dt. 

Ki ~ ~ K M 

. . ^2 ^2 

Note that in the context of problem (1251 ). ^ = a2 y^_ a 2^ +0 .a y e a ■ Clearly, this ordering favours the sensors with better 
channels and higher measurement quality. Then the optimal values of af (and hence a*) can also be expressed as 



af 



, otherwise 



where 



%r^Mi Pi _ X 



and the number of sensors which are active, Mi (which can be shown to be unique [6]), satisfies 



Mi 2 S^ Ml \pi\ H~J2 I „2 „2 spM t +l \p t \ /~~Z2 f~2 ~ 

fe^ E&g-f V £i=\ +1 #-§ V «m 1+1 



2 J Minimizing error covariance: The corresponding problem of minimizing the error covariance in the orthogonal 
scheme is equivalent to 

M *ma 



mm 



E 



*?,...,< £j a? /if a? + al 



M / c 2 a 2 \ 

subject to a? f \ + af\ < 7,,,,,,, 

8=1 V ° / 



which is again a convex problem in {a\ , . . . , a|^). For an analytical solution [24], consider a more general problem 



mm / — j 

M < 28 > 

subject to ^ a?«j < 7 totai 
»=i 

where x > 0, y > 0, > 0, pi G R, Tj > 0, i = 1, . . . , M are constants. Then the optimal c^'s satisfy 



Tj \ V Xk 

Assuming that the sensors are ordered so that 

d. > ... > ?k 

K\ ~ ~ KM 

„2 



2 - 1 I -Itfrt al I . (29) 



the optimal values of af to problem ( 1281 ) can also be expressed as 



a f = \ i(V©-^) , i<Mi 
I , otherwise 
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where 

1 ™ _i_ V^-Aii k, 2 

1 _ 7tota« + Z^i=l ~ °n 

and the number of sensors which are active, Mi (which is again unique), satisfies 



, v^Mi k,- 2 / 2 9 i v^Mi+1 k,- 2 I 2 9 

T^ + E^n _ ^2 > Q and Tfertaf + E^i _ a2 



C. Remarks 

1) In the orthogonal scheme, the solutions of the optimization problems (l26l ) and (128T ) take the form (127T ) and 
(|29l ) respectively. These expressions are reminiscent of the "water-filling" solutions in wireless communications, 
where only sensors of sufficiently high quality measurements will be allocated power, while sensors with lower 
quality measurements are turned off. On the other hand, the solutions for problems (fT9l ) and (l23T > have the form (12T1 ) 
and (l24l respectively, which indicates that all sensors will get allocated some non-zero power when we perform 
the optimization. The intuition behind this is that in the multi-access scheme some "averaging" can be done when 
measurements are added together, which can reduce the effects of noise and improve performance, while this can't 
be done in the orthogonal scheme so that turning off low quality sensors will save power. 

2) The four optimization problems we consider (problems (fT9l ), (|23T ). (l26l) and (l28l) ) have analytical solutions, 
and can admit distributed implementations, which may be important in large sensor networks. For problem ( fT9l ) the 
fusion center can calculate the values A and [i and broadcast them to all sensors, and for problem d23l the fusion 

{ M P 2 \ ~* 

center can calculate and broadcast the quantity ( Ej=i ( K+7t t \ T - /a 2 ) 2 K i) to all sensors. The sensors can then 
use these quantities and their local information to compute the optimal a^s, see [23]. For problems (f26l ) and (T28T ). 
the fusion center can compute and broadcast the quantity A to all sensors, which can then determine their optimal 
aj's using A and their local information, see [24]. 



D. Fading channels with CSI 

We will now consider channel gains that are randomly time-varying. In this section we let both the sensors and 
fusion center have channel state information (CSI), so that the h^k's are known, while Section IIV-EI considers 
fading channels without CSI. We now also allow the amplification factors ± to be time-varying. 

1 ) Multi-access: Recall from (0]) that the Kalman filter recursion for the error covariances is 



2 



Pk+1 - -2 p , - + °"w 

where c k = Ei^i ai.fc^i.&Cj and r k = Y^iU a i,k h i,k a i + a n- 

One way in which we can formulate an optimization problem is to minimize the sum of powers used at each 
time instant, subject to P k +i\k < D at all time instances k. That is, for all k, we want to solve 

M M 

mm 



ivi ivi , 2 2 s 

i=l i=l ^ ' 

0?Pf.ff. n 

subject to P k+1 = _ +a^<D 

ctPk + r k 



(30) 



The constraint can be rearranged to be equivalent to 

f k (a 2 P k + a 2 w -D)+ c\Pk{al - D) < 

which looks rather similar to (fTTT ). In fact, once we've solved the problem (1301 at an initial time instance, e.g. 
k = 1, then P 2 = D is satisfied, so that further problems become essentially identical to what was solved in 
Section IIV-A. 1 1 Therefore, the only slight difference is in the initial optimization problem, though this is also 
covered by the general problem (|T9b - 
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Another possible optimization problem is to minimize P k +i\k at eacn ti me instant subject to a sum power 
constraint ^totai at each time k, i.e. 



. a 2 Pkr k . 2 

mm P k+ i = _ 2 + a w 



iPk + r k 



c k 

M / 2 2 

subject to a lk ( l l °_ W 2 + a i ) ^ Ttotai 
i=l ^ a 



(31) 



As we can rewrite the objective as 



a 2 Pkf k /4 2 



Pk + ffc/cg 

it is clear that minimizing the objective function is equivalent to minimizing f k jc\. So at each time step we 
essentially solve the same problem (|23l considered in Section [IV-A.2I while updating the value of P k +\ every time. 

2) Orthogonal access: Recall from (f5]) that in the orthogonal scheme, the Kalman filter recursion for the error 
covariance is: 

2 po 

po _ Q r k , 2 

i + p° k c° k R° k c° k 

If we wish to minimize the sum power while keeping P k+l < D at all time instances, the constraint becomes 

t o-^o_^ «l^\A >a ^ + al-D 



^a 2 ,^2 2 + 2 - pom -a, 

l i,k i,k i ' n k \ 



If we wish to minimize -P^ +1 at each time instance subject to a sum power constraint at all times /c, then this is 
the same as maximizing 

M rv 2 h 2 r 2 

^k K fc C fc - 2^ a 2 ^2 2, 2 

i=l a i,k' l i,k a i ^ a n 

In both cases, the resulting optimization problems which are to be solved at each time instant are variants of 
problems (|26l ) and (|28l ), and can be handled using the same techniques. 

3) Remarks: As discussed in Section [IV-CI these problems can be solved in a distributed manner, with the fusion 
center broadcasting some global constants that can then be used by the individual sensors to computer their optimal 
power allocation. The main issue with running these optimizations at every time step is the cost of obtaining channel 
state information. If the channels don't vary too quickly one might be able to use the same values for the channel 
gains over a number of different time steps. However if the channels vary quickly then estimating the channels at 
each time step may not be feasible or practical. In this case we propose one possible alternative, which is the use 
of a linear estimator that depends only on the channel statistics, and which will be derived in Section IIV-EI 

4) A dynamic programming formulation: The optimization problems we have formulated in this section follow 
a "greedy" approach where we have constraints that must be satisfied at each time step, which allows us to use the 
same techniques as in Sections IIV-AI and IIV-BI The motivation behind this follows from the monotonic properties 
of the solution to the Riccati equations (0]) or ([5]). An alternative formulation is to consider constraints on the long 
term averages of the estimation error and transmission powers. For instance, instead of problem (|3"TT ). one might 
consider instead the infinite horizon problem: 

1 T 

mm^lim^-^ElPfc+i] 
k=l 



subject to lim ^S^HS^hM < 1 total 



k=l i=l 

where we wish to determine policies that will minimize the expected error covariance subject to the average sum 
power being less than a threshold Jtotal- Solving such problems will require dynamic programming techniques, and 
would involve discretization of the optimization variables similar to [46], where optimal quantizers were designed 
for HMM state estimation over bandwidth contained channels using a stochastic control approach. This approach 
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is however highly computationally demanding. A thorough study of these problems is beyond the scope of this 
paper and is currently under investigation. 



E. Fading channels without CSI 

Suppose now that CSI is not available at either the sensors or fusion center, though channel statistics are available^ 
The optimal filters in this case will be nonlinear and highly complex, see e.g. [47]. An alternative is to consider 
the best linear estimator in the minimum mean squared error (MMSE) sense, based on [37]. In our notation, the 
situation considered in [37] would be applicable to the model Xk+i = axk + Wk, = otkhkCXk + Vk- While this is 
not quite the same as the situations that we are considering in this paper, their techniques can be suitably extended. 

1 ) Multi-access scheme: Since we do not have CSI we cannot do transmitter beamforming and must return to 
the full complex model (Q]). We will also restrict d^ = dj,VA; to be time invariant. The main difference from [37] 
is that the innovations is now defined as 

Assuming that the processes {hi^},i = 1,...,M are i.i.d. over time, with real and imaginary components 
independent of each other, and {hi^} independent of {w^} and {t^fc}, i = 1, . . . , M, the linear MMSE estimator 
for scalar systems can then be derived following the methods of [37] (also see [48]) as follows: 







. 3[5 fc ] _ 





x k+l\k 
■ffc-1 



a%k\k 

a2 Pk\k 
%k+l\k + -FjH 



CP, 



fc+i[fc» 



where C 



R = 



^fc+l|fc+l = x k+l\k + ^k+l\k l 

Pk+l\k+l = p k+l\k - P k+l\k 

[ E£iE[»[ai^]]cj ZZl^MaMfo 
Var[5ft[dA]]g| + E[K 2 [dA]K? 



R 



-i 



(»[z fc 



+ ij,9[%.+i]) T - Cx fcH 



i|fe 



(32) 



C-Pfc+i|fcC T + R 



and 



Var[9[d^]]gJ +E[Z 2 [d i h i ]}o-! 



2_a=l 



+ < 



using the shorthand $t 2 [X] = ($t[X]) 2 and 9= 2 [X] = (^[X]) 2 . 

These equations look like the Kalman filter equations but with different C and R matrices, so much of our 
previous analysis will apply@ For instance, since the estimator is not using the instantaneous time-varying channel 
gains but only the channel statistics (which are assumed to be constant), there will be a steady state error covariance 

given by 

(a 2 - 1) + a 2 w S + v/(a 2 - 1 + a 2 5) 2 + 4a 2 S 



2S 



with S = C T R _1 C. Note that for circularly symmetric fading channels e.g. Rayleigh, we have C = [ ], and 
estimates obtained using this estimator will not be useful^ Thus we will now restrict ourselves to non-zero mean 
fading processes. Motivated by transmitter beamforming in the case with CSI, let us use amplification factors of 
the form 

(E[W 



ft; 



on 



\E[hi]\ 



with ftj G R. Then S simplifies to 



S 



Var[sft[dA]]c; 



i l-a 1 



+ E[^[dih l ]]a 2 ) +a, 



We note that this can also be used to model the situation where the sensors are not perfectly synchronized [25]. 

'in fact one can regard it as an "equivalent" linear system (with a stable dynamics and stationary noise processes) along the lines of [48]. 
10 Other work where there are difficulties with circularly symmetric fading include [9], [25], [44]. A possible scheme for estimation of 
i.d. processes and zero-mean channels which can achieve a 1/logAf scaling has been proposed in [44]. 
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where we can find 



Emaihi}} = ai\E[hi 



Var^d;^]] 



a; 



E*[9?/ii]Var[9Wii] + E 2 [3/i;]Var[Q/i. 



|E[/k]|2 

2 

E^fdA]] = fE 2 pi^ i ]E[» 2 ^] + 2E 2 [»/i i ]E 2 [9tfi i ] +E 2 [9fo i ]E[3 3 /l i 



(33) 



|EN| 2 

using the shorthand E 2 [X] = (E[X\) 2 ,$t 2 [X] = (K[X]) 2 and % 2 [X] = (9[X]) 2 . If the real and imaginary 
parts are identically distributed, we have the further simplifications Var[3ft[dj/tj]] = a 2 Var[5ft/ij] and E[3ft 2 [dj/ij]] = 
a?(E[K 2 ^]+E 2 [^]). 

Power allocation using this sub-optimal estimator can then be developed, and the resulting optimization problems 
(which are omitted for brevity) will be variants of problems ([T9l ) and d23l ). We note however that the optimization 
problems will only need to be run once since C and R are time-invariant quantities, rather than at each time instance 
as in the case with CSI. 

Since we have a steady state error covariance using this estimator, asymptotic behaviour can also be analyzed 
by using the techniques in Sections JII] The details are omitted for brevity. 

2) Orthogonal access scheme: For orthogonal access and no CSI, the equations for the linear MMSE can also 
be similarly derived and will be of the form (l32l ). substituting C° in place of C, R° in place of R, etc. We have 



C° = [ E[«[ai^i]]ci E[3[ai7ii]]ci 



K[R[a M h M ]}cM E[S[a M hM]]c-M 
R° n ... 



and 



R 







R 



MM 



with each R°- being a block matrix 

Var^y c 2 ^ + E[^ 2 [d t h t ]}af + a 2 n 



R 



Var[9f[^^]]cf + E[3f 2 [dA]]<x 2 + a 



There will be a steady state error covariance given by 



po 



with 5° = C° R° C°. If we choose d, 



M 



1) + a 2 w S° + V(« 2 - 1 + o- 2 w S°) 2 + Aa 2 w S° 



2S° 



Mi*])- 



a.% \ J,Vi'' then S° can be shown to be 

\E[hi]\ 



Evenki]]* 



i=i 



Var^d^Jc 2 ^ + E[W[aihi]]<T? + a 2 



where we also refer to (1331) for further simplifications of these quantities. 

Asymptotic behaviour and optimal power allocation can also be analyzed using the techniques in Sections [TTT] 
and IIV-BI respectively, and the details are omitted for brevity. 



V. Numerical studies 

A. Static channels 

First we show some plots for the asymptotic results of Section |III] In Fig. Q] (a) we plot P M vs M in the multi- 
access scheme for the symmetric situation with an = \l\f~M, and a = 0.8, a 2 = 1.5, <r 2 = 1, c = 1, a 2 = 1, h = 0.8. 

2 / !2 i 2 / 2 \ 

We compare this with the asymptotic expression a 2 w + a ^ f"' — '-jj from ( [TBI . Fig. Q] (b) plots the difference 

between P^ — a 2 ^, and compares this with the term a av+ c 2 n — - -h. We can see that P^ is well approximated by 
the asymptotic expression even for 20-30 sensors. 

In Fig. |2] we plot vs M in the multi-access scheme with cq = l/yM,a = 0.9, er^ = 1,<t 2 = 1 and values 
for Cj, a 2 , hi chosen from the range 0.5 < Cj < 1,0.5 < Ri < 1,0.5 < h{ < 1. We also plot the (asymptotic) lower 
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Asymptotic expression 



20 40 60 80 100 



Asymptotic expression - o 2 



Fig. 1. Comparison between and asymptotic expression: Multi-access scheme with cti = 1/yM 




p .-< 

Lower bound - c 2 

Upper bound - G 2 



80 100 



Fig. 2. Poo with general parameters and bounds: Multi-access scheme with a.i = 1/yM 



and upper bounds (l37l ) from the proof of Lemma |4] o\, + a ^ h ^ a ^ Jr(y ^ -L and cr^ + a ( h ™$* a <^* +(r ^ -L . it can 
be seen that does indeed lie between the two bounds, both of which converge to at the rate 1/M. 

Next we look at the numerical results for optimal power allocation. In Fig. [3] we compare between using optimal 
power allocation and equal power allocation for the multi-access scheme. We use a = 0.9, a 2 = 10~ 9 ,cj^, = 
l,Cj = l,Vi The sensor noise variances af are drawn from a x 2 (l) distribution to model the differences in sensor 
measurement quality. The channel gains hi are modelled as d^ 2 , with representing the distance of sensor i to 
the fusion center. We use distances uniformly drawn between 20m and 100m. In Fig. [3ja) we keep D = 2, while 
in Fig. [3fb) we keep jtotai = 10 -3 . Each of the data points represent the average over 1000 realisations of the 
sensor parameters (i.e. Q,of ,<ij). In Fig. [4] the comparison using the same parameters and parameter distributions 
is shown for the orthogonal scheme. What can be observed is that as the number of sensors M increases there is a 
general trend downwards for both graphs, though optimal power allocation seems to provide more benefits in the 
orthogonal access scheme than the multi-access scheme. 



B. Fading channels 

In Fig. |5]we compare between the full CSI and no CSI situations for the multi-access scheme, using a = 0.9, <r^ = 
10 _9 ,<7^, = l,Cj = l,Vi, and af drawn from a x 2 (l) distribution. The complex channel gains hi^s are chosen to 
be Rician distributed with distance dependence. Specifically, the real and imaginary parts of hi f. are chosen to be 
distributed as d^ 2 x N(fj,i, 1), with di uniform between 20 and 100, and fa uniform between 1/2 and 1. In Fig. 
I2a) we keep D = 2, and in Fig. [2b) we keep ^totai = 10~ 3 . In the full CSI case the values are averaged over 
1000 time steps for each set of sensor parameters (i.e. Ci,af,di, /Xj), and in the no CSI case they are the steady 
state values using the linear MMSE estimator (l32l . The results are then repeated and further averaged over 100 
realisations of the sensor parameters. In Fig. [6] we make the same comparison for the orthogonal scheme. We can 
see in Fig. [5] that for the multi-access scheme the performance loss in the case without CSI is not too great when 
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Fig. 3. Multi-access. Comparison between optimal and equal power allocation schemes, with (a) an error covariance constraint and (b) a 
sum power constraint 




5 10 15 20 5 10 15 20 

M M 



Fig. 4. Orthogonal access. Comparison between optimal and equal power allocation schemes, with (a) an error covariance constraint and 
(b) a sum power constraint 



compared to the case with full CSI. Thus even if one has full CSI, but doesn't want to perform power allocation 
at every time step, using the linear MMSE estimator d32l ) instead could be an attractive alternative. On the other 
hand, for the orthogonal scheme in Fig. [6] there is a more significant performance loss in the situation with no CSI. 

VI. Extension to vector states and MIMO 

In Section IVI-AI we formulate a possible extension of our work to vector state linear systems. We outline some 
of the differences and difficulties that will be encountered when compared with the scalar case. In Section |VI-B I we 
consider a situation similar to a MIMO system, where the fusion center has multiple receive antennas (and each 
sensor operating with a single transmit antenna), and we show how they can be written as an equivalent vector 
linear system. 

A. Vector states 

We consider a general vector model 

x fc+ i = Ax fc + w fc 

with x G M. n , A 6 M nxn , and £ W 1 being Gaussian with zero-mean and covariance matrix Q. For a stable 
system all the eigenvalues of the matrix A will have magnitude less than 1. The M sensors each observe 

y ijk = Qx fc + \ iik , i = 1, . . . , M 

with y i k G W 71 , Q £ M. mxn , and Vj k £ M m being Gaussian with zero-mean and covariance matrix R,. We assume 
that each of the individual components of the measurement vectors k are amplified and forwarded to a fusion 
center via separate orthogonal channels^ We will consider real channel gains for simplicity. 

"Another possibility is to apply compression on the measured signal [7], [23], so that the dimensionality of the signal that the sensor 
transmits is smaller than the dimension of the measurement vector, but for simplicity we will not consider this here. 
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Fig. 6. Orthogonal access. Comparison between the full CSI and no CSI situations, with (a) an error covariance constraint and (b) a sum 
power constraint 



In the multi-access scheme the fusion center then receives 

M 

i=l 

where ct^fc G R mxm is a matrix of amplification factors, Hj^ G R mxm a matrix of channel gains, and G M m 
is Gaussian with zero-mean and covariance matrix N. We can express the situation as 

x fc+ i=Ax fe +Wfc, z k = C k x k + \ k 

where C k = Y^iLi W-i,k<*i,h£u Vfe = YsiU ^i,kOL i)k y itk +n k , with \ k having covariance matrix R k = YhLi ^,fc a i,fc R * a i|fc H Mr 
N. The error covariance updates as follows: 

P fc+ i = AP fc A T - AP fc Cfc (C k V k C T k + Rk)- 1 ^^ + Q 

The transmit power of sensor i at time k is 

-y hk = Tr(a i>k E\y k jl]al k ) 

= TrK fc (QE[x fc x£]Cf + Ri)o£ fc ) 

where Tr(») denotes the trace, and E[xfcX^] satisfies (see [41, p. 71]) 

E[x fe x^] - AE[x k xl]A T = Q 

In the static channel case, the steady state error covariance satisfies 

Poo = AP oc A T - AP 00 C T (CP 00 C T + R^CPooA 7 + Q 
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However, unlike the scalar case where the closed form expression (|7]) exists, in the vector case no such formula for 
Poo is available, and thus asymptotic analysis is difficult to develop. For time-varying channels, we can pose similar 
optimization problems as considered in Section [TV] For instance, minimization of the error covariance subject to a 
sum power constraint can be written as: 



min Tr(P fc+1 ) 

a lik ,...,a M ,k 

M 



subject to ^2(a iik (CiE[x k xl]Cf + Ri)al k ) < j total 



(34) 



i=l 



This problem is non-convex, and unlike the scalar case does not appear to be able to be reformulated into a 
convex problem. Similar problems have been considered previously in the context of parameter estimation, and 
sub-optimal solutions were presented using techniques such as deriving bounds on the error covariance [27], and 
convex relaxation techniques [23]. 

In the orthogonal access scheme the fusion center receives 



We can express the situation as 
by defining 



Zi,fc = H^a^fcy^ + ni )k , i = l,...,M 
Xfc+i = Ax fc + w fc , z° k = C° k x k + \° k 









r° - 


_ z M,k 




i% being 





Hi^ai^Ci 

Hm, k a M,kC M 



R 



H 1)fc a lifc R lQ ^H^ + N 












H-l,k a l,kiVl,k + n l,fc 
Hjw ,k a M, k y M ,k + n M,k 








H 2 ,fcQ;2,fcR2Ct2,fc H 2,fc + N ••• 

• • • H M ^ k aM,k^MOr^ 4 fc H^ f k 



N 



The error covariance updates as follows: 

P° k+1 = AP£A T - APgcf (C£Pgcf + K)- l C° k n^ T + Q 

_ T — — T 

The term C k (C^P^C^ + K k )~ l C k can be rewritten using the matrix inversion lemma as 

r° T (r v°r° T ^v \- l r — r oT ~a ~ l r° T r , ° T p ~ 1 r , ° T /'P " 1 _i_ r° T it ' 1 r° T \- 1 r° T ii ~ 1 r° 7 

^k K^k^k^k + K feJ ^k — ^k K fc ^k ~ ^k K fc ^k l r fc "+" ^k K fc ^k ) ^k K fc ^k 

where we have the simplification 



M 



k K fc ^k 



^(Hi !fc Qi ifc Q) T (H iifc Q! iifc RiQ!^ A .H^ +N) 1 (H i)fc citi ) feC i ) 



(35) 



Minimization of the error covariance subject to a sum power constraint can be written as: 

min Tr(P£ +1 ) 

a lik ,...,a M ,k 

M 

subject to ^2(a i:k (CiE[x k xl]Cf + Ri)al k ) < j tota i 
i=i 

This problem is non-convex and also does not appear to be able to be reformulated into a convex problem. In the 
context of parameter estimation with sensors communicating to a fusion center via orthogonal channels, a similar 
problem was considered in [49], and was in fact shown to be NP-hard, although sub-optimal methods for solving 
that problem were later studied in [7]. 

As the techniques involved are quite different from what has currently been presented, a comprehensive study 
of optimization problems such as (l34l and (|35T ) is beyond the scope of this paper and will be studied elsewhere. 
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B. MIMO situation 

One could also consider a situation resembling the MIMO systems in wireless communications, with the different 
sensors (each with a single transmit antenna) representing the multiple transmitters, and multiple receive antennas 
at the fusion center. It turns out that these situations can be expressed as equivalent vector linear systems. We will 
show how this is done for a simple case. Consider the vector state, scalar measurement system 

x k+1 = Ax fc + Wfe, y it k = CiX k + v i>k , i = l,...,M 

where Cj,Vz are 1 x n vectors. We will look at the orthogonal access scheme, but now with L receive antennas at 
the fusion center. The fusion center then receives from each sensor 

z i,k = [h\ k &i,kyi,k + nj >k , . . . , hf k a ijk y iik + nf >k ] T ', i = 1, . . . , M 

where h\ k is the channel gain from the i-th sensor to the j-th antenna. Defining 

Zl,fc ] 

— r 1 T L T 1 T L T 1 

z k = : , Cfc = [h^ai^^ | . . . \h l k ai^ k Ci \ I^M,fc a M,feC M | . . . \h Mjk aM,k c M\ 

_ z M,k 

Vk = [h\ )k ai t kVi t k + n\ k, • • ■ , hi k ax jk vx )k + nf fc , h} M ^a.M,kVm,k + «M,fc> • • • > ^Af,fc a m,fc u m,& + n^- jfc ] T 

we may then write the situation as the vector system: 

x fc+ i = Ax fc + w fc , z k = C k x k + \ k 

Other variations of the MIMO setup, e.g. vector sensor measurements, can be similarly transformed into equivalent 
vector linear systems. Note that for scalar state and scalar measurements per sensor, one could use similar techniques 
to Section II-B for problem formulation and those of Sections IV-B and IV-D for the optimal power allocation 
results. However, as described in Section I VI- A I difficulties in analyzing general vector systems will still remain. 

VII. Conclusion 

This paper has investigated the use of analog forwarding in the distributed estimation of stable scalar linear 
systems. We have shown a 1/M scaling behaviour of the error covariance in a number of different situations, and 
formulated and solved some optimal power allocation problems for both static and fading channels. We have also 
outlined extensions to vector linear systems and MIMO systems. Further study of these extensions and related 
problems will form the topics of future investigations. 



Appendix 



A. Proof of Lemma [7] 
Rewrite ((8]) as 



(a 2 ~ 1) 1 fi / (a 2 -!) 2 1 E + MI + < 
2 5 2 V 4 S 2 2 5 4 



Taking the derivative with respect to S we get 

dP^ a 2 -I I (a 2 -l) 2 ^ + {a 2 + l)a 2 ± 



dS 2 S 2 , / (a 2 -!) 2 1 , (a 2 +l)aj 1 ~~ 
4 V 4 3^ + 2 S + 



To show that — 0> ^ * s sufficient to show that 



2 



(a 2 -!) 2 ^ + (a 2 + l)cx 2 ^ ) ^ fa 2 - 1 1 



> 



(a 2 -!) 2 1 (a 2 +l)o- 2 1 ai J \ 2 S 2 

4 S 2 + 2 S + 4 



Expanding and rearranging, this is equivalent to 



(a 2 ~ l) 4 ^ + 2(a 2 - l) 2 (a 2 + l)al ± + (a 2 + 1) 2 ^-1 

> {a 2 ~ I) 4 ^ + 2(a 2 - l) 2 (a 2 + l)a 2 -I + (a 2 - l) 2 ^ 



or (a + 1) er* > (a — 1) cr^, which is certainly true. 
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B. Proof of Lemma \3\ 

We first substitute the simplified expressions for c and f into (0): 

p — 
1 \ — 



(a 2 - l){Mh 2 a 2 + op + M 2 h 2 c 2 a 2 w + ^((a 2 - \){Mh 2 a 2 + gj + M 2 /i 2 c 2 ^) 2 + 4M 2 fe 2 cV 2 (M/i 2 q 2 + gj 

2M 2 /i 2 c 2 

Regarded as a function of M, we are interested in the behaviour of P m as M — > oo. Now 

V((a 2 - + a 2 ) + M 2 /i 2 c 2 cr 2 ) 2 + 4M 2 /i 2 c 2 <t 2 (M/i 2 a 2 + a 2 ) 

= (hW w M A + 2(a 2 - l)a 2 v h 4 c 2 a 2 w M 3 + ih 4 c 2 a 2 w a 2 M 3 + 0(M 2 )) 1/2 

. kW ( 1+ ^±M + o(^)) 1/2 



^ CT ™ M ( 1+ 2 cV 2 M + °{W 



= h l c l a l w M l + {a 2 + l)h 2 a 2 M + 0(1) 

where we have used the expansion (1 + x) 1 / 2 = 1 + x/2 + 0(x 2 ) for \x\ < 1 [50, p. 15], which is valid when M 
is sufficiently large. Hence 



Poo = °l + ~^^7 + O 



a 2 al 1 



M V M2 



C. Proof of ' Lemma ^\ 

We first prove the statements for the multi-access scheme. We have Mh m i n c m i n < J2i=i ^iCi < Mh ma xCm a x and 
^^rnin a rnin — Z^i ^ 2 °f — ^^max a max- R ecai l fr° m Lemma Q] that P^ is a decreasing function of S = c 2 /f. 
If we choose a-i E {+1, —1} such that ajCj is positive for all i, we have 

M/i 2 ■ a 2 ■ + a 2 f Mh 2 a 2 + a 2 

mm mtfl 1 n <^ __ max^max 1 w n 



/Vf2/,2 c 2 - c 2 ~ M 2 /) 2 r 2 

""max^max ° "mm mm 

and by a similar calculation to d36l ) we can show that as M — » oo 

,2i,2 „2 / 1 \ ^2l2 ^2 



n 2 /7 2 rr 2 1 / 1 \ n 2 /) 2 n- 2 1 / 1 \ 

~~P ^ M V M 2 / w ft 2 c 2 M I M 2 / 

u max L max lvl \ JlJ / n min c min lvl \ lvl / 



If instead we choose Oj G {1/yM, — 1/vM} such that c^q is positive for all i, we can similarly show that as 

M -> oo 



.. . .,2 ^2 i ^2 



a (^min a min + °n) ^ , -q ^ ^ ^ p ^ _2 , Q (^mm ff m<a + a n) 1 , n f /o 7 \ 

/i 2 c 2 M Uf 2 / " /i 2 c 2 M \ M 2 I { n 

u max c max lvl \ nl / n min L min 1VI J 



mm mm 

2 



In either case, as the upper and lower bounds both converge to a 2 ^ at a rate of 1/M, P^ itself will also do so. 

For the orthogonal scheme, a similar argument to the above shows that choosing Oj G {+1, —1} gives convergence 
of Pgc to a 2 ^ at the rate 1/M for general parameters. 

To show that P^ in general does not converge to a limit as M — > oo, when using the scaling l/\/M in the 
orthogonal scheme, consider the following example. Suppose there are two distinct sets of "symmetric" parameters 
with behaviour as in (fl4l) . such that if all the sensors had the first set of parameters the error covariance would 
converge to P^ v and if all the sensors had the second set of parameters the error covariance would converge 
to P£q 2 , with P^ 2 7^ P^ 1 . Then let the first Mi sensors have the first set of parameters, the next M 2 (with 
M2 » Mi) sensors the second set, the next M3 (with M3 >> M2) sensors the first set, the next M4 (with 
M4 >> M3) sensors the second set, etc... Then P£, would alternate between approaching P^ 1 and P^ 2 , and will 
not converge to a limit as M ^00. 
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D. Proof of Lemma\5\ 

With the multi-access scheme and the allocation (fT5l ). by defining 

7(1 -a 2 ) _ a 7(1 -a 2 ) 



a, 



- 2 • a 2 
mm tn 



r 2 ■ 

ruin 



(I" 

we can show similar to the proof of Lemma [4] that 



v 2 ■ h 2 ■ a 2 ■ 

mm mm mm 



M 2 



h2 pi 

^max ' L max ^max 



< 



■-max " w 



a 2 + (T 2 

^1/1 i v rr?. 



,(l 



-^^max^max^max ®r< 

M 2 a 2 h? c 2 

mm mm mm 



Hence as M 



<rf„ + 



oo we have 
a 2 a 2 ■ h 2 ■ a 2 ■ 1 

mm mm mm -4-0 



y-y2 ^2 ^2 

"max ' "max ^max 



M 



\M 2 



< Poo < <C + 



a 2 a 2 /i 2 a 2 



a 2 • /i 2 • c 2 • 

mm mm mm 



M 



The other cases can be treated similarly as in the proof of Lemma 0] 
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